Collocation and term extraction using linguistically enhanced statistical methods
نویسنده
چکیده
منابع مشابه
You Can't Beat Frequency (Unless You Use Linguistic Knowledge) - A Qualitative Evaluation of Association Measures for Collocation and Term Extraction
In the past years, a number of lexical association measures have been studied to help extract new scientific terminology or general-language collocations. The implicit assumption of this research was that newly designed term measures involving more sophisticated statistical criteria would outperform simple counts of cooccurrence frequencies. We here explicitly test this assumption. By way of fo...
متن کاملAn Extensive Empirical Study of Collocation Extraction Methods
This paper presents a status quo of an ongoing research study of collocations – an essential linguistic phenomenon having a wide spectrum of applications in the field of natural language processing. The core of the work is an empirical evaluation of a comprehensive list of automatic collocation extraction methods using precision-recall measures and a proposal of a new approach integrating multi...
متن کاملFuzzy Set Theoretic Approach To Collocation Extraction
Fuzzy approach deals with the linguistic properties of elements such as beauty, coldness, hotness etc. Collocations are linguistically motivated. Decision of word combination for being collocation is a linguistic term as merely co-occurrence of word combinations does not signify the presence of collocation. Thus collocation extraction can be made possible by looking its linguistic aspect. In th...
متن کاملAutomatic Term and Collocation Extraction from English-Croatian corpus
Term and collocation bases represent valuable additional resources covering specific domain and frequently expressions, which then can be used in further research. The paper presents possible model of building terminology and collocation base, using statistical and linguistic approaches in order to gain experience in building of such resources for the English Croatian language pair. The aim of ...
متن کاملText Mining for the Extraction of Domain Relevant Terms and Term collocations
The domain adaptation capability of information extraction (IE) systems relies on automatic acquisition of domain specific knowledge. The domain specific knowledge contains domain relevant terms, semantic relations for ontology building, or lexicosyntactic patterns for template filling [Riloff & Jones 1999 and Yangarber et al 2000]. Recently, an ever-growing interest in automatic term extractio...
متن کامل